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1.  INTRODUCTION 


The  articles  that  are  being  surveyed  in  this  paper  were  obtained 
with  the  aid  o£  a  computerized  search  of  two  data  bases:  GEOREF  from 
Dialog  Information  Services  and  COMP  from  BRS  Information  Technologies. 

In  the  GEOREF  data  base,  there  are  937  articles  written  by  Russian 
scientists  on  various  topics  in  remote  sensing  during  the  years  1785 
through  1985.  Between  1980  and  1985  alone,  there  are  532  articles.  I 
have  decided  to  search  the  GEOREF  data  base  during  the  years  1980-1985 
which  yielded  159  articles.  The  COMP  data  base  gave  a  list  of  123 
articles  written  by  Russian  scientists  in  remote  sensing  during  the 
years  1976-1985.  A  printout  of  282  article  titles,  their  authors,  and 
abstracts  was  obtained  for  the  initial  investigation. 

There  is  no  way  to  specify  that  we  are  interested,  say,  in  mathe¬ 
matical  modeling  in  remote . sensing  and  obtain  a  complete  list  of  the 
desired  articles.  One  had  to  read  the  282  abstracts  and  decide  which  ones 
seemed  to  be  relevant  to  this  study.  Out  of  the  282  articles,  only  63 
appeared  to  have  any  statistical  analysis  of  the  observed  data.  After 
reading  the  63  articles,  58  were  found  to  be  of  interest.  By  inspecting 
the  recent  issues  of  the  most  important  journals  of  the  Russian  literature 
on  remote  sensing  (to  be  discussed  below)  19  references  were  added,  for  a 
total  of  77  articles.  The  most  striking  fact  is  that  only  4 2  of  the 
articles  are  not  translated  into  English  (3  out  of  77  in  our  reference  list). 

There  are  three  journals  that  account  for  approximately  90%  of  all 
the  articles  in  our  reference  list.  One  can  safely  state  that  the  research 
published  in  these  three  journals  gives  a  clear  picture  of  the  remote 
sensing  problems  studies  by  Russian  scientists.  These  three  journals  are: 
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1.  Soviet  Journal  of  Remote  Sensing,  the  English  translation  of 
Issledovaniye  Zemli  iz  Kosmosa. 

2.  Izvestiya,  Atmospheric  and  Oceanic  Physics,  the  English 
translation  of  Izvestiya,  Akadenrii  Nauk  SSSR,  Fizika  Atmos phera 
Okeana. 

3.  Soviet  Meteorology  and  Hydrology,  the  English  translation  of 
Meteorologiya  i  Gidrologia. 

Several  comnents  about  these  three  journals  are  in  order.  The  English 
translation  of  the  first  journal  appears  from  1980,  the  second  journal 
from  1965,  and  the  third  journal  from  1976.  It  was  not  clear  at  all  from 
the  computerized  search  that  the  Sov.  J.  of  Remote  Sensing  is  indeed  the 
English  translation  of  Issledovaniye  Zemli  iz  Kosmosa.  as  the  first  name 
appeared  in  GEOREF  and  the  second  in  COMP.  Moreover,  it  was  not  stated 
that  Izvestiya,  Akademii  Nauk  SSSR,  Fizika  Atmosphera  Okeana  has  been 
translated  into  English,  and  all  the  references  were  given  in  the  data  base 
for  the  original  articles  in  Russian. 

We  now  proceed  to  give  a  brief  description  of  the  organization  of 
this  review  article.  In  the  second  section  of  this  article,  we  will 
review  the  major  areas  of  remote  sensing  and  state  the  statistical  techniques 
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that  were  used  in  each  of  the  areas.  In  Section  3,  we  will  discuss  briefly 
each  of  the  statistical  techniques  that  have  been  mentioned  in  Section  2. 
Then,  we  will  conclude  our  survey  with  a  short  summary  followed  by  a 
bibliography. 

2.  AREAS  OF  APPLICATION  OF  REMOTE  SENSING  TECHNIQUES 

The  articles  included  in  this  study  can  be  classified  into  four  major 
areas:  Agriculture,  Geology,  Meteorology,  and  Oceanography.  The  percentage 
breakdown  of  the  total  number  of  articles  in  each  area  is:  21%  (16  out  of 
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77),  21%  (16  our  of  77),  39%  (30  out  of  77),  and  19%  (15  out  of  77),  respectively. 
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We  now  proceed  to  describe  Che  types  of  problems  studied  in  each  of  the 
areas  mentioned  above  and  list  the  statistical  techniques  that  have  been 
used  to  analyze  the  observed  data. 


a.  Agriculture .  The  following  topics  are  discussed  in  this  area 
of  application  of  remote  sensing  techniques:  evaluating  the  status  of 
farmland  by  monitoring  the  optical  parameters  of  the  atmosphere  ([2],  [11], 
[43]),  estimation  and  identification  of  the  states  of  vegetative  cover 
from  airborne  observations  ([12],  [39]),  estimating  biomass  of  grass  and 
semishrub  vegetation  ([75]),  evaluation  of  the  accuracy  of  land  class 
determination  and  forest  sites  ([29]),  interpretation  of  aerospace  photo- 
graphs  of  forests  ([4],  [23]),  analysis  of  canopy-soil  optical  reflectance 
([30],  [54],  [77]),  and  radiation  studies  of  soils  and  vegetative  covers 
([5],  [40],  [67]). 

We  now  proceed  to  list  the  statistical  techniques  employed  in  these 
articles:  point  estimation  of  means  and  standard  deviations  ([2],  [5], 

[12]),  classification  analysis  ([4],  [29]),  discriminant  analysis  ([23], 

[29]),  cluster  analysis  ([73]),  analysis  of  variance  ([30]),  regression  analysis 
([5],  [11],  [12],  [39],  [75]),  stochastic  modeling  ([40],  [43],  [54],  [67], 

[75],  [77]),  simulation  ([40],  [67],  [77]),  spatial  analysis  ([77]),  and 
spectral  analysis  ([5]). 

b.  Geology.  The  following  topics  are  discussed  here:  earth  resources. 
surveying  ([6],  [52],  [56],  [66],  [68]),  choosing  the  spectral  zones  to 
distinguish  the  natural  objects  on  the  earth  surface  by  their  spectral 
characteristics  ([9]),  determining  the  state  of  ground  objects  ([10]),  earth 
exploration  by  using  optical  imaging  systems  ([17],  [27],  [70],  [72]), 


geological  investigation  from  observations  made  from  space  ([20],  [42], 
[71]),  developing  and  controlling  an  earth  sensing  space  system  ([S3]), 
and  formulating  a  signal-to-noise  ratio  concept  for  a  spaceborne  photo¬ 
graphic  system  ( [49] ) . 

We  now  list  the  statistical  techniques  used  in  these  articles: 
classification  analysis  ([10],  [27],  [42],  [52],  [71]),  discrim  nant 
analysis  ([27]),  econometrics  ([6],  [S3]),  data  validation  ( { S2  ), 
filtering  theory  ([20]),  information  theory  ([49]),  modified  MLE  ([10]), 
nonlinear  estimation  ([10]),  optimal  sampling  ([9],  [70]),  optimal  decision 
control  ([53]),  optimal  linear  prediction  ([56]),  stochastic  modeling 
([6],  [17],  [66],  [68]),  spatial  analysis  ([20],  [72]),  and  spectral 
analysis  ([72]). 

c.  Meteorology  .  The  following  topics  are  discussed  in  this  area 
of  application  of  remote  sensing  techniques:  statistical  characteristics 
of  the  spatial  thickness  of  the  atmosphere  ([1]),  determining  integral 
sioisture  content  in  a  cloudless  atmosphere  ([3],  [47]),  temperature-wind 
atmosphere  sounding  by  radioacoustic  methods  ([7],  [15],  [21]),  scattering 
of  radar  signals  by  clouds  and  percipitation  ([8]),  examining  the  informa¬ 
tiveness  of  serological  and  remote  sensing  systems  ([13],  [14],  [28],  [51]), 
mathematical  models  for  cloud  cover  ([16],  [50]),  interpreting  the 
characteristics  of  the  outgoing  thermal  radiation  field  of  earth-atmosphere 
system  ([7],  [58],  [59],  [60],  [61],  [62],  [63],  [65])  construction  of 
snow  cover  maps  from  data  provided  by  meteorological  satelites  ([25]), 
estimating  the  volume  coefficient  of  absorption  by  aerosols  from  airborne 
measurements  of  spectral  fluxes  ([31]),  determining  the  earth's  surface 


temperatures  and  weather  forecasting  ((32],  [57],  [64],  [74]),  automated 
analysis  of  aerospace  photographs  in  monitoring  polution  ([38]),  dependence 
of  signal  processing  on  optical  parameters  ([41]),  laser  beam  intensity 
fluctuations  spectra  in  atmospheric  precipitation  ([76]). 

The  statistical  techniques  that  have  been  employed  to  analyze  the 
data  in  articles  mentioned  above  are:  point  estimation  of  means  and 
standard  deviations  ([7],  [8],  [25],  [31],  [41]),  classification  analysis 
([38],  [51]),  cluster  analysis  ([16],  [38]),  inverse  problems  in  linear 
and  nonlinear  models  ([47],  [58],  [60],  [61],  [63],  [64]),  multivariate 
estimation  ([14],  [51],  [63],  [64]),  principal  component  analysis  ([1], 
[28]),  regression  models  ([3],  [32],  [47],  [50],  [57],  [58],  [62],  [63], 
[65]),  sampling  techniques  ([13]),  simulation  ([16],  [41],  [47]),  spatial 
analysis  ([22],  [61],  [62],  [64],  [76]),  spectral  analysis  ([57],  [76]), 
filter  theory  ([22],  [59],  [63]),  stochastic  modeling  ([15],  [16],  [21], 
[50],  [59]). 

d.  Oceanography.  The  following  topics  are  studied  in  this  area: 
radiometric  discrimination  of  sea  ice  ([18]),  remote  radiometric  methods 
of  investigating  sea  surfaces  ([19],  [35],  [37]),  interpretation  of  remotely 
sensed  ice  cover  data  from  its  thermal  emission  ([24]),  acoustic 
tomography  of  the  ocean  ([26]),  analysis  of  remotely  sensed  data  of 
internal  waves  ([33],  [44]),  spatial  structure  of  the  radiance  field  for 
the  ocean-atmosphere  system  ([34]),  analysis  of  the  structure  of  sea  surface 
temperature  field  ([36],  [45],  [46]),  year-to-year  variation  of  the 
radiation  budget  of  ocean  surface  ([48]),  optimal  procedures  for  determining 
the  sea  surface  and  atmospheric  parameters  ([55],  [69]). 
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The  statistical  methods  discussed  in  these  articles  are:  point 
estimation  of  means  and  standard  deviations  ([19],  [44]),  classification 
analysis  ([18]),  discriminant  analysis  ([37]),  regression  models  ([37], 
[45],  [46],  [55],  [69]),  optimal  design  ([26]),  spatial  analysis  ([34], 
[35],  [36],  [45],  [46],  [48]),  spectral  analysis  ([33],  [34],  [48]), 
stochastic  modeling  ([24],  [36],  [37],  [46],  [69]). 


3.  STATISTICAL  TECHNIQUES  USED  IN  REMOTE  SENSING 


In  this  section  we  briefly  describe  some  of  the  statistical  techniques 
that  have  been  used  in  the  current  Russian  literature  on  remote  sensing, 
a.  Estimation  ( [2]  ,  [5],  [7],  [8],  [10],  [12],  [14],  [19],  [25], 
[31],  [41],  [44],  [51],  [74]). 


The  references  cited  above  do  not  include  articles  that  discuss  esti¬ 
mation  for  regression  or  spatial  models.  Most  of  the  estimation  techniques 
used  here  are  quite  elementary,  though  in  some  cases  the  mathematical  models 
that  describe  the  physical  phenomena  can  be  quite  involved  and  far  beyond 
the  scope  of  knowledge  of  a  statistician. 

In  [2]  the  authors  estimate  the  var(J),  where  J  ■  Ij/l2  and  and  I2 

are  the  radiances  at  wavelengths  X^  and  X^,  respectively.  Optical  para¬ 
meters  are  related  to  radiance  by  the  following  equation:  I  =  Iq  exp(-f), 
where  t  is  the  optical  thickness  of  the  atmosphere.  For  the  two  wavelengths 
X^  and  X^  the  variance  covariance  matrix  for  and  is  assumed  to  be 
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o, ,  cr  R  ,  where  R,_  is  the  correlation  coefficient. 

11  22  12  12 


2  2 

®u  °22-  .  ■, 

(^  T2 

The  authors  use  the  fact  J  *  .Kt^.t^)  “  e  and  use  the  first  term  of 


the  Taylor  series  expansion  for  evaluated  at  (E(t1>,  E(t2>)  to 


obtain  an  approximation: 


var(j)  :  e 
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For  several  cities  in  Russia  E(J)  and  the  approximation  for  var(J)  are 

*  » 

estimated.  The  quantity  e  *  o^/ECJ)  is  used  as  an  estimate  for  the 
maximum  relative  error  in  their  estimation  procedure.  In  the  paper  itself 
the  authors  define  e  =  Oj /J. 


In  [10]  a  uniform  surface  M  is  partitioned  into  a  large  number  of 
elements,  N,  called  macroelements.  Each  of  these  N  elements  is  divided 
into  elementary  cells,  called  microelements,  which  characterize  the  specific 
properties  of  an  object  in  H.  Without  loss  of  generality  it  is  assumed 
that  the  number  of  microelements  is  constant  and  equal  to  its  mean  value 
n  >>  1.  The  state  of  each  microelement  is  described  by  the  vector  x  with 
pdf  P(x).  The  radiance  of  a  microelement  is  described  by  y  and  it  is 
assumed  that  y  *  f(x)  for  some  function  f.  In  this  paper,  the  authors 
treat  x  and  ^  as  scalars. 

How,  if  y.  and  x.  are  the  radiance  and  state  variables,  respectively, 

1  n  1  n 

of  an  xth  microelement  in  a  macroelement,  then  "Y  *  —  t  y.  and  X  *  —  l  x. 

n  ...  'i  n  .  ,  i 

i*l  1*1 

are  the  mean  radiance  and  state  vector  of  an  object  for  an  arbitrary 
macroelement.  The  stochastic  model  for  Y  as  a  function  of  X  is  discussed 
via  the  regression  curve  of  Y  on  X  and  the  residual  variance  after  regression 
The  central  problem  that  concerns  the  authors  in  this  paper  is  to 
estimate  the  moments  of  X  (most  often  the  first  two)  given  the  data  Y^.Y^,... 
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They  use  the  maximum  likelihood  approach  and  consider  the  following  log- 
likelihood  equation: 

N 

I  v  log  P(Yi,a1, . . . ,aL,y1> . . . ,yM^)/vyk  *  0  ,  k  ■  1,2,..., M  <  M*  , 

where  a^  are  the  coefficients  in  the  series  used  to  approximate  the  function 
f  (x) : 

L-l 

f(x)  =  I  a  . 
q=0  * 

P(Y)  is  substituted  into  the  log-likelihood  equation  as  an  Edgeworth 
series  with  M  terms.  Hence  only  M  out  of  M*  parameters  can  be  estimated  and 
^M+l  *  *  ‘  *  ,VM*  are  t*ie  nuisance  parameters.  In  this  article  the  nuisance 
parameters  are  expressed  as  functions  of  y^,...,y  .  These  functions  are 
chosen  to  maximize  the  entropy  of  the  P(x)  distribution: 


S  -  -  /  P(x)  log  P(x)dx. 


To  study  the  precision  of  the  estimates  obtained  using  this  approach  the 


authors  suggest  evaluating  the  variance  for  the  estimate  due  to  y£  moment 


fluctuation : 


_  00  S(y  )  n  „ 

3  c  /  e  1  (u£-yp  (vyk/vy£)2dy£ 
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where  k  *  1,...,M,  l  ■  M+1,...,M*,  y£  is  the  fixed  value  assigned  to  y£ , 
c  is  a  normalizing  constant  and  v  denotes  the  partial  derivative... 

At  the  end  of  the  article  the  authors  consider  distributions  from  the 


exponential  class  as  an  approximation  for  P(Y): 
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where  Y 


is  a  standardized  value  of  Y.  They  use  the  Cramer-Rao 


inequality  to  estimate  the  minimal  variance  of  the  estimates  for  p. , . . . ,pu( 

1  M 

obtained  by  the  modified  maximum  likelihood  procedure  discussed  above. 

In  [41]  Monte  Carlo  techniques  were  used  to  study  the  degree  of 

polarization  of  the  multiple  scattered  background  reflected  by  clouds.  The 

dependence  of  total  signal  5,  (x)  on  the  optical  sensing  depth  t  and  the 

b 

characteristic  parameter  n,  which  determines  the  optico-geometric  illumina¬ 
tion  conditions,  had  been  studied.  The  depolarization  of  second-scattering 
signal  was  estimated  as  a  function  of  t  and  n.  Standard  error  for  this 
estimate  was  computed. 

The  authors  conclude  that  at  an  optical  depth  t  :  1  and  x  <_  .02,  the 
double  scattering  signal  represent  about  80%  of  the  entire  background 
for  both  cloud  models  considered.  As  x  increases  as  well  as  the  penetration 
depth  of  the  radiation  into  the  cloud,  the  fraction  of  the  double  scattering 
signal  in  the  relected  background  drops  off  significantly. 

In  [44]  the  authors  investigate  how  the  internal  wave  modifies  the 
structure  of  the  surface  wave  motion  and  how  and  under  what  conditions  these 
changes  will  influence  the  image  of  the  ocean  surface. 

Let  the  image  of  solar  glitter  at  the  ocean  surface  form  in  a 
receiving  device  located  at  height  h  above  the  sea  level.  The  receiving 
device  consists  of  a  large  number  of  elementary  receivers  with  narrow 
directivity  patterns,  forming  together  the  viewing  field  of  the  instrument. 
Under  various  assumptions  the  authors  state  that  a  random  occurrence  of 
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In  [74]  the  geopotential  field  in  the  "true"  atate  at  time  t  -  t^ 
is  denoted  by  H.  -  H(tJ  and  the  observed  geopotential  values  by 
X.  *  X(t .).  The  observed  data  consists  of  xq,xi* ' ' ' ,Xk  at  times 
r  <  t,  <...<  t.  .  The  authors  are  interested  in  predicting  IL  . 

Assuming  that  PC^I^)  -  p<^klHk»x0*  •  •  •  ^-l^  the  authors  obtain 
the  following  representation: 

(1)  p(hJx0 . ^  *  const-  p(lhci^o» '  *  *  ,3Sc-l>P(^lc * 

They  use  (1)  to  estimate  the  true  state  of  H  at  time  t  *  t^  from 

Xq.Xj...  .  .X^  by  choosing  that  maximizes  (1).  To  evaluate  P^l^), 

they  assume  that  X.  *  B.H.  ♦  N..  where  B.  is  the  time-variable  conversion 
7  -J  J-J  -J  J 

matrix  and  N.  is  the  random  error  independent  from  H. 's.  E(N.)  ■  0  and 

E(N.nT)  *  C„6(t.  -  t.)  is  the  covariance  matrix.  Then 
~i~j  N  l  j 

(2)  P<*Jy  -  const.  .*»<- 

To  derive  the  formula  for  p(\ lx0»  •  •  •  *^-1^'  the  authora  aasu,De  that  the 
geopotential  information  for  times  t  <  t^  can  be  accumulated  in  the  geo¬ 
potential  forecast  calculated  for  time  t  ■  t^  from  Xq,X^ , . . . ,X^_^ . 
Hence,  we  use  the  following  approximation: 

and  approximate  it  by  a  multivariate  normal  distribution: 

o)  Ft^icl^o*  *  *  *  ’iSt-i5  :  const*  exp{'  %(?k“?k)T(ck)"l(!k“!k)>* 

where  is  the  forecasting  error  covariance  matrix.  Substituting  (2)  and 

(3)  in  (1)  yields: 

(4)  p(Hk|x0 . *  const.  exp{-  %(Hk~H^)T(C^) 


is  estimated  by  maximizing  (4)  (provided  we  know  and  C^): 

k  -  £  ♦  vh  - 


where 


f  T  f  T  -1 

K.  »  c  B  (B  C  B  +  c  ) 

\  kkkkk  N 


The  authors  discuss  how  to  estimate  C,  and  C„. 

k  N 


b.  Classification  and  Discrimination  ([4],  [10],  [18],  [23],  [27], 


[29],  [37],  [38],  [42],  [51],  [52],  [71]). 

In  this  paper,  [18],  a  Bayesian  classification  rule  is  used  for 
radiometric  discrimination  of  sea  ice  types. 

Let  x  ■  (x.,..., x)  be  the  vector  of  characteristics  of  a  given  image. 
P(ok|x)  denotes  the  conditional  probability  of  the  image  to  be  classified 
in  class  ui. ,  given  it  corresponds  to  the  observed  characteristics  x, 

i  ■  1,2 . M.  If  L^j  is  the  loss  that  one  suffers  if  an  image  that  belongs 

to  class  j  was  classified  into  class  i,  then  the  mean  value  of  losses  due 
to  misclassification  is  given  by 


l  L. .P(xlu.)P(u.) 
i-1  1J  1  1 


T.(x)  *  I  L..P(oj.|x)  ■  M 
J  '  i«l  1  ~  I 


P(x|wfc)P(u)k) 


where  P(uO  are  a  priori  probabilities  that  the  object  chosen  at  random 
belongs  to  class  The  classification  rule  that  minimizes  the  total  losses 

assigns  the  image  x  to  class  uk  if 

x^(x)  <  Tj(x)  ,  i  +  j,  j  *  1 , . . . ,M. 

Under  the  assumption  of  0-1  loss,  the  above  translates  to 

P(x |u.  )P(u>.  )  >  P(x|u.)P(u>  )  . 


Experiments  on  radio  thermal  radiation  of  sea  ice  of  different 
types  show  that  the  mean  values  of  brightness  temperatures  at  two  wave¬ 
lengths  (1.6  and  3.2  cm.)  can  be  used  as  characteristics.  Empirical 
probability  distribution  for  brightness  temperatures  were  constructed 
for  each  of  the  ice  types  and  wavelengths.  These  were  used  to  approximate 
P(x|u.).  The  P(u. lx)  were  then  evaluated  and  used  in  the  classification 
procedure.  To  improve  the  radiometric  discrimination  of  ice  types,  the 
authors  found  it  is  necessary  to  use  a  digital  nonlinear  lowpass  filter, 
which  smooths  only  those  values  of  brightness  temperature  which  differ 
from  the  current  mean  value  with  not  more  than  a  specified  value  e. 

They  also  come  to  a  conclusion  that  using  infrared  channel  data  makes  it 
possible  to  distinguish  thin  ice. 


In  [29]  the  authors  are  proposing  a  procedure  to  evaluate  a  classification 
rule  in  forestry  studies. 

Assume  that  there  are  N  classes  of  features  to  be  recognized  in  a 
given  image.  Define  for  each  ith  class  n^  fixed  test  sites  of  equal  area 
and  group  them  into  two  sequences:  a  reference  sequence  of  n^  sites  and 
a  control  sequence  of  n^c  sites.  The  reference  sample  will  be  used  to 
define  the  classification  procedure  and  the  control  sample  to  evaluate  its 
accuracy.  For  the  control  sample,  each  observation  receives  three  scores: 
correct  classification,  misclassification  or  omission  (the  computer  cannot 
decide) . 


Suppose  that  of  all  n.c  sites  from  the  ith  class  presented  to  the 

computer  n  were  classified  correctly,  n  misclassif ied  and  n  omitted, 
p .  a .  r . 

i  Mi  l 


with  n  +  n  ♦  n  *  n.  .  Define  p.  -  (n  /n.  )100  as  percent  of 
p.  q.  r.  ic  *i  p.  ic  r 

1  ^1  l  ri 


correct  classif icationsm  q.  ■  (n  /n.  )100  as  percent  misclassif ications 

^i  q.  ic  r 


and  r.  =  (n  /n.  )100  as  percent  of  omissions.  Relative  misclassif ication 
l  r .  ic 

l 


of  class  i  with  respect  to  all  j  f  i  classes  i6  defined  as  the  sum  of 
misclassification  rates 


N 

Z  q.  .  , 


1  3-1 


where  q„  is  percent  data  from  class  i  that  was  misclassif ied  into  class  j. 


The  authors  define  a  decision  matrix,  denoted  by  n,  that  gives  the  computer 
recognition  decisions  for  all  the  sites: 


A  •  A  m  M  • 

pi  ql2 


nqlN  "ri 


nq21  np2  •••  nq2K  nr2 


|_  "qNl  nqN2 


npN  nrN  jNx(N-l) 


N 

Define  n  -  In.,  the  total  number  of  correct  classifications  and 

P  j-!  PJ 

N 

n  *  I  n.  the  total  number  of  control  sites.  Then  p  =(n  /n  )100  is 
c  j=l  Jc  PC 

the  average  percentage  of  correct  classifications  in  a  given  test.  By 
adding  the  decision  matrix  the  sum  row  containing  n^,  n^  and  p,  we  obtain 
what  the  authors  call  an  "extended  decision  matrix." 

This  extended  decision  matrix  is  used  to  compare  two  classification 


techniques  that  are  based  on  space  scanner  image  processing:  single  and 
two-step  color  selection.  The  single  step  color  classification  method 
interprets  in  the  interactive  mode  multiband  and  false-color  forest  images 


with  the  aim  of  identifying  various  forest  areas  differing  by  the 
dominant  species.  With  this  method  the  optical  densities  of  registered 
band  image  elements  provide  color  signatures  for  the  features  to  be 
recognized . 

Let  x  *  (x^ . Xjj)  be  the  N-ditnensional  vector  of  color  signatures 

where  N  is  the  number  of  band  images.  The  classification  rule  used  is  the 
same  as  in  [18]  for  the  0-1  loss  function.  The  discriminant  function  for 
the  kth  and  jth  class  is  given  by 

F  -  P(u>  )P(x|u .  )  -  PU.)P(x|u>.) 

KJ  K  ~  K  J  *  J 

which  is  positive  for  x's  that  lead  to  a  classification  into  the  kth  class 
Since  P(x|oj^)  is  usually  unknown,  the  following  nonparametric  estimate  is 
used : 

.  M  ,  x  -  x.  . 

P(xk)  =  i  l  ~  v(~-~r---) 

“  k  M  i-1  VM  h 

where  M  is  the  length  of  the  reference  sample,  VM  is  the  volume  of  the 
elementary  hyperwindow, 


x  -  x,  . 

vC^-TT*)  " 


1  for 


0  otherwise 


and  h  is  the  window  side.  Hence 


P(u,  )  M 


F.  .(*) 


\  )  M  ,  x-x,  . 

r  h-  v(-Si)  - 


V 

i=l  M 


P(w.)  M  ,  x-x,  . 

- L.  Z  1-  v(^i)  . 

M  VM  h 

1=1  M 


The  size  h  is  selected  to  match  the  length  of  the  sample:  it  decreases  for 

larger  M.  With  integer  valued  signature  ■  1  and  hence 

M  M 

F.  .  =  P(*.  )  I  v(x  -  x.  . )  -  P(u.)  I  v(x  -  x..)  , 


Assume  that  P,1 .  are  known  sampled  values  of  the  respective  P.  .(x.,t). 

kj  kj  'i 

Each  observed  result  is  associated  with  an  n-ditnensional  message  vector 

S,  such  that  its  ith  coordinate  is  equal  to  j  if  the  ith  source  generated 

the  message  v..  There  are  N  =  Mn  different  vectors  S^,  q  =  1,...,N. 

With  each  S  the  authors  associate  the  Bayes  a  posteriori  probabilities 
~q 


P(uJS  )  =  ~77 

“Tel  _q  M 


P<-k)P(§q1Uk) 


l  P(w,  )P(S  u.  ) 
k  -q1  k 


Since  the  data  sources  are  independent,  then 


p(s  U,  )  =  n  p.1.  , 
~q‘  k  kJ 


j =  v 

where  S  is  the  ith  coordinate  of  S  .  Let  v.,  j  =  1,...,M,  be  the 
q  ~q  J 

messages  at  the  DPS  output,  meaning  the  object  is  in  state  w ^ •  Subject  to 
several  assumptions,  the  authors  derive  the  probability  that  the  generated 
message  at  the  DPS  output  is  when,  in  fact,  is  the  true  state: 

N 

P(v.|u.  )  *  I  P(v. js  )P(S  |u  ). 

J  k  =1  J 1 ~q  -q 1  k 

The  DPS  information  matrix  Q  is  defined  as  Q  =  [P(v.|l:  The  diagonal 

j  k  MXM 

elements  of  Q,  are  the  conditional  probabilities  of  correct 

classifications.  W(o)  denotes  the  data  validity  as  a  function  of  p.  =  P(w.) 

3  3 

is  defined: 

M 

W(p)  =  T.  p  ,  P(v  .  Jw  . ) . 
j-1  J  J  J 

Since  P(v.|ui.)  are  determined  by  the  entries  Q.  via  the  formulas  for 
J1  J  1 

2 

P(v,  |S  )  and  P(S  | cu,  ) ,  W  is  a  function  of  M  *  n  +  II  variables.  The  authors 


investigate  W(p)  for  the  case  M  ■  2  and  n  =  2  for  various  matrices  Q^. 

They  observe  that  W(p)  is  minimum  for  p  *  (A»  •  • • »n)  and  that  it  varies 
slowly  about  this  minimum  value  if  n  >  2.  They  suggest  using  this  lower 
bound  as  an  approximation  for  the  data  validity. 

c.  Cluster  Analysis  ([16],  [38],  [73]). 

In  [73]  a  technique  is  presented  for  clustering  farmlands  on  the  basis 
of  spectral  radiance  measurements  using  the  relative  spatial  position  of 
fields  as  an  auxiliary  variable.  The  following  assumption  is  made:  two 
adjacent  fields  occupied  by  cultivated  vegetation  belong  to  different 
classes. 

Assume  that  n  measurements  of  spectral  radiance  have  to  be  clustered. 

Internal  structure  links  are  introduced  so  that  the  set  of  measurements 

are  ordered.  This  representation  of  the  measurements  creates  a  graph 

G  ■  (V,E),  where  V  *  {0, ,0_ , . . . ,0  }  is  the  set  of  vertices  and 

i  z  n 

E  ■  {£,,£„,...,£  }  is  the  set  of  edges.  If  the  set  E  is  ordered  as 
i  z  m 

l.  <  £,  <  ...  <  a  i  then  G  is  an  ordered  graph.  0.  e  V  are  referred  to 
i  —  Z  —  —  m  i 

as  objects  and  £^  e  E  are  the  links.  A  proximity  graph  P  =  (V,E)  is  an 

ordered  graph  where  V  is  the  set  of  objects  to  be  clustered  and  E  is  the 

set  of  links.  The  links  £.>£.,...,£  may  have  different  meanings: 

1  Z  n 

1.  Ruling  out  the  inclusion  of  points  of  an  edge  in  the  same  cluster. 

2.  Allowing  the  objects  of  an  edge  to  be  in  the  same  cluster  but 

requiring  the  ordering  relation  of  the  links  to  be  examined. 

If  £.  *  0  0  and  £ .  *  0  0  ,  then  £.  <  £.  implies  that  the  degree  of 
l  p  q  3  r  s  i  -  J 

similarity  between  0  and  0  is  at  least  the  same  as  between  0  and  0  . 

*  n  r  c 


V' 


20 

The  authors  have  used  this  graph  theoretic  clustering  procedure  for 
a  farmland  inventory  from  multispectral  space  and  air  photos.  A  territory 
of  the  Wroclav  province  in  Poland  with  213  farms  was  used  in  this  study. 

The  average  values  of  optical  densities  were  determined  for  each  field. 

A  proximity  graph  was  then  constructed  from  these  densities.  Based  on 
this  technique  the  213  fields  were  clustered  in  13  clusters.  The  authors 
have  found  this  technique  to  be  very  satisfactory  in  solving  practical  crop 
inventory  problems. 


d.  Sampling  ([9],  [13],  [70]). 

This  paper,  [9],  discusses  a  procedure  for  selecting  spectral  channels 
for  collecting  remote  sensing  data  on  the  basis  of  Kullback  information 
measure . 

Let  x  be  an  observed  vector  of  spectral  brightness  coefficient  in  n 
different  channels.  Based  on  these  measurements  various  objects  have  to  be 
discriminated.  For  the  sake  of  simplicity  the  authors  assume  that  there 
are  two  classes  of  objects,  whose  features  are  described  by  pdf's  f^(x)  and 
f^(x).  The  Kullback  information  measure  for  difference  between  f^(x)  and 
fj(x)  is  defined 

fT  (x) 

J ( 1 ; 2 ,x)  =  J  { f ^ ( x)  -  f^Cx)}  in  j  ^  dx  . 

The  authors  assume  that  f.(x)  has  a  multivariate  normal  distribution  with 

l  - 

mean  and  covariance  matrix  1^,  i  *  1,2.  They  evaluate  the  increase  in 
J(l;2,x)  when  n  is  increased  to  n  +  1 ,  denoted  by  AJ(l;2,x).  The  authors 
proceed  to  investigate  the  conditions  under  which  AJ(l;2,x)  =  0.  Assuming 


d1  ,  *  covCX1^1  ,)  ;  1  (in  fact,  they  could  have  assumed  that 
n,n+l  n  n+1 


cov(  Z  a.X.,X  )  :  1),  in  order  to  evaluate  AJ(l;2,x)  one  has  to  perturb 
•  •  xx  n^l 
i=l 

the  n+1  dimensional  covariance  matrices  by  XI,  where  X  >  0  and  I  is  a 
(n+1)  x  (n+1)  identity  matrix  and  at  the  end  let  X  +  0.  The  condition  of 
AJ(l;2,x)  *  0  at  X  •+  0  is  that 


V1  =  Y2  =  H*1 

i  rn 

°n+l  n+1 

- r2 -  is  the  ratio  of  standard  deviations  for  X  ,  and  X 

l  n+1  n 

'  o 

n  ,n 


where  y 


with  respect  to  f*  and  r.  9  E(x})  -  E(X2).  Relation  (1)  makes  it  possible 

J  J  J 

to  select  the  spectral  channels  that  contain  new  information.  A  threshold 
is  defined  beyond  which  the  addition  of  channels  does  not  provide  any 
significant  information. 


e.  Stochastic  Modeling  and  Simulation  ([6],  [15],  [16],  [17],  [21], 

[24],  [36],  [37],  [40],  [41],  [43],  [46],  [47],  [50],  [54],  [59], 

[66] ,  [67] ,  [68] ,  [69] ,  [77]). 

In  [16]  the  author  is  interested  in  modeling  spatial  and  time  variations 
in  the  distribution  of  cloud  cover  over  Earth's  surface.  The  authors  state 
that  for  distances  r  less  than  3-5  thousand  kilometers  and  time  intervals  x 
less  than  4-6  days,  the  successive  observations  are  correlated.  In  satellite- 
based  earth  observations,  the  distribution  of  cloud  cover  over  the  earth's 
surface  may  be  defined  as  a  discrete  space-time  random  field  U(x,^,t), 
which  takes  on  values:  0  if  there  is  no  cloud  cover  over  the  observation 


point  with  geographical  coordinate  X,  y  and  time  t,  and  1  if  cloud  cover 


t  units  of  time  and  G(r)  ■  [g ^ ^ C r ) ]  is  the  same  matrix  for  different  regions 
separated  by  a  distance  r. 

The  matrix  Q(t)  is  derived  by  solving  the  Kolmogorov  equation 

3?  -  «F  •  0(0)  m  ln  • 

T 

where  1  =  (1,1,... .1)  and  F  =  [f,  .]  is  the  matrix  of  transition  intensities 

n  kj  nxn 

J  n 

For  k  =  1, . .  .  ,n  and  k  +  j ,  f^  =  1/T^  >  0  and  ffck  =  -  Z  f^  ,  where 

is  the  average  time  during  which  the  process  remains  in  the  state  before 

passing  into  B..  T,  .  is  estimated  from  the  data  and  the  obtained  matrix 
J  kj 

F  is  used  as  a  first  approximation  for  the  derivation  of  Q(t).  The  actual 

values  of  Q(t)  are  obtained  by  solving  an  identification  problem.  Similar 

technique  is  applied  to  obtain  G(r). 

After  Q  and  G  are  derived,  the  author  obtains  the  steady  state 

n 

distribution  for  4>  *  QG:  P.  ,P_,...,P  and  derives  E[B]  =  E  B.P., 

12  n  i.i  1  1 


In  [68]  the  cloudiness  values  over  a  given  surface  point  is  modeled  as 

a  continuous  Markov  process  of  the  diffusion  type,  £  ,  characterized  by  two 

parameters:  the  transport  coefficient  a  and  diffusivity  b,  where  z  is  the 

distance  between  the  initial  surface  point  and  the  current  point  under  study. 

0  <  ^  <  1,  where  0  and  1  are  reflection  points.  The  author  considers  the 

random  quantity  6 ( C z ^  where  6(x)  is  a  twice  continuously  differentiable 

function  on  [0,1].  If  u(x,y)  =  E[0(^z) |Cq  =  x] ,  then  u(x,y)  is  the  solution 

of  the  following  partial  differential  equation: 

.  2 

a  —  +  _  v  u  -  vu 

vx  2  2  vz  ’ 

vx 

with  the  initial  condition  u(x,0)  =  9(x)  and  boundary  conditions  =  'u^-* 

vx  vx 

*  0.  The  solution  u(x,y)  is  obtained  by  using  the  Fourier  method,  and  is  a 


function  of  the  quantity  c  *  ^  and  0(x).  By  selecting  appropriate  functions 
6(x),  the  author  derives  the  cdf  for  cloudiness  over  the  surface  site,  the 
transition  probability  of  the  cloudiness  value  at  a  distance  z  from  initial 
point  from  segment  [a^c^]  to  segment  and  the  correlation  function. 

By  comparing  the  theoretical  formula  for  the  cdf  of  cloudiness  with 
the  empirical  cdf  for  cloudiness,  one  can  obtain  the  value  of  c  =  — .  The 
value  of  b  is  obtained  by  comparing  the  empirical  and  theoretical  values 
of  the  correlation  function.  Thus  the  parameters  a  and  b  of  the  process 
can  be  determined  from  observed  data. 

The  author  goes  on  and  uses  the  above  approach  to  determine  E(t  ), 
where  is  the  random  distance  along  the  ground  track  from  the  initial 
point  where  the  cloudiness  is  equal  to  x,  to  the  point  where  it  reaches  a 
threshold  y. 

f.  Regression  Analysis  and  Inverse  Problems  ([3],  [5],  [11],  [12],  [32], 
[37],  [39],  [45],  [46],  [47],  [50],  [55],  [57],  [58],  [60],  [61], 
[62],  [63],  [64],  [65],  [69],  [75]). 

In  [58],  after  several  assumptions  are  made,  the  authors  propose  the 
following  stochastic  model  of  infrared  radiation  measurements  in  the  present 
of  stratified  patchy  clouds: 

k  k 

(1)  I(v)  -  (1  -  I  o.)I(c)(v)  +  I  a.I^6)(v)  . 

.  ,  l  .,ii 

i=l  1=1 

In  (1),  I(v)  is  the  outgoing  radiation  in  the  spectral  range  centered  around 

(c)  (6) 

the  wave  number  v,  I  (v)  and  I.  being  the  outgoing  at  the  level  p  (at 

1  S  . 

1 

which  the  cloud  layers  are  located),  i  =  l,...,k,  in  a  cloudless  atmosphere 


9*2 


h»3 


and  in  a  completely  overcast  sky,  respectively,  and  a  ■  is 

the  k-dimensional  vector  of  characteristics  of  multilayer  clouds,  each  of 
whose  components  is  uniformly  distributed  over  [0,1].  Next,  the  authors 
proceed  to  express  I  (v)  and  1^  ' (v)  by  radiative  transfer  integral 
equations  that  relate  the  outgoing  radiation  to  the  pressure  at  the  lower 
boundary  of  the  corresponding  cloud  layer  and  the  temperature  as  a  function 
of  that  pressure.  These  integral  equations  are  reduced  to  depend  only  on 
Pq,  the  pressure  at  ground  level.  The  next  transformation  involves  linear¬ 
ization  of  the  resulting  integral  equations  followed  by  finite  dimensional 
approximation  of  the  linearized  equation. 


The  following  model  is  obtained: 


AQ(a)x  +  Ba  +  e  -  f 


where  E(e)  *  0,  S  is  the  sample  estimate  of  E(ee  ),  Q(a)  *  (1  -  1  a.)E 

k  ~  "  i”1  1  m 

+  Z  o.H.,  B  *  (b  , ...,b  ),  b.  "  A(H.  -  E)T,  A  is  an  m  x  m  matrix  describing 

£ae^  ^  ^  a  K  1  X  *** 

the  finite  dimensional  approximation  of  linearized  integral  operators,  E 

m 

is  an  m  »  s  identity  matrix,  are  m  x  m  matrices,  T  is  the  vector  of  sampl 
means,  x^  *  T(p^)  -  ET(pJ  and  f^  is  the  deviation  of  the  measured  outgoing 
radiation  by  remote  sensing  1^  from  the  calculated  one  I ^ . 

A  satellite-borne  detector  collects  data  on  the  outgoing  radiation  at  n 
spectral  channels:  I(v^) , . . . .iCv^)  in  the  present  of  additive  independent 
random  noice  e.  'Based  on  these  data  and  model  (2),  the  authors,  using  ideas 
from  nonlinear  regression  rechniques,  derive  estimates  for  x  and  a.  They 
introduce  y  *  Qx,  neglect  its  dependence  on  a  and  by  assuming  that  y,  x  and 
a  are  deterministic,  derive  the  following  explicit  estimates  for  a  and  x: 

a  *  (BTAAT  +  BBT)“1f 

/  o  \  1 


-I  -  Q‘1(aI)(AAT  +  BBT)_1f  . 


Since  x^  depends  on  f  nonlinearly,  it  is  impossible  to  derive  explicit 

expressions  for  the  covariance  matrix  of  x^.  The  authors  also  claim  that 

these  estimates  are  not  accurate.  To  improve  the  accuracy  of  these  estimates 

the  authors  assume  that  x  and  a  are  random  vectors  whose  first  two  sample 

moments  are  known  x,  R  ,  a  and  R  ,  where  E(x)  *  0.  The  best  linear  unbiased 
_  x  a 

estimates  of  x  and  a  are: 


(4) 


?n  *  V?  -  B“> 


Ojj  *  a  +  Ljf  -  Ba]  , 


where  L  *  R  QT(a)ATG_1,  L  =  R  BTG_1,  G  *  A0AT  +  BR  BT  +  S  and  0  *  Q(a)R  QT(a) 
x  xx  .  ’  .  a  a  a  _xH. 


The  covariance  matrices  for  these  estimates  are  estimated  by: 

(5) 

Model  (2)  and  estimates  (4)  and  (5)  are  extended  for  other  related  problems. 


D(x  )  *  R  -  R  QT(a)ATG_1AQ(a)R 
.II  x  x  -  -  x 

D(a  )  *  R  -  R  BTG_1BR  . 

_II  a  a  a 


In  this  article,  [65],  the  authors  present  the  following  statistical 
model  that  relates  the  vector  measurement  of  radiation  characteristics  y, 
with  the  vertical  ozone  profile  in  the  atmosphere  x: 

y  =  A  [x]  +  £  , 

T 

where  y  =  (y^,...,y  )  (s  =  mk,  where  m  is  the  number  of  spectral  intervals 

and  k  is  the  number  of  space  points  or  directions  per  interval), 

T 

x  =  (x, ,...,x  )  ,  A[x]  is  a  nonlinear  function  of  x,  and  e  the  random  error, 
.in-  - 

The  components  of  c  represent  the  errors  that  occur  in  the  measurements  and 

in  the  model  itself.  We  assume  that  E(e)  =  0  and  SI  is  the  sample  estimate  of 
T 

E(  ee  ).  Moreover,  we  assume  that  we  have  estimated  on  the  basis  of  a  sample 
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x,  y,  R  -  ^  { [x  -  x] tx  -  x]T},  Q  *  j  { [y  -  y] [y  -  y]T}  and 
1  -  -T 

K  *  -  { ty  “  y] [x  -  x]  }.  Let  x  *  E(x)  +  Ax  and  assume  that  Ax  and  e  are 


independent . 

Three  methods,  based  on  the  least  squares  principle,  for  estimating 
x  from  y  are  discussed: 

1.  Formal  Least  Squares  Method.  It  is  assumed  that  (x,y)  has  a 
multivariate  normal  distribution.  If  the  first  two  moments  of  the 
distribution  are  known,  then  the  LSE  has  the  form 

x  »  x  +  KT(Q  +  n)  1[y  -  y]  , 
and  the  covariance  matrix  of  x  is  estimated  by 

R  «  R  -  KT(Q  +  n')~1K  . 

2.  Stochastic  Linearization  Method.  This  approach  assumes 

y  *  b  +  CAx  , 

where  b  *  (b, _ _ ,b  )  and  C  ■  (c..)  are  the  unknown  coefficients.  The 

1  *»  ij 

optimum  choice  of  parameters  is  determined  by: 

b  *  y  and  C  ■  KR  *  . 

In  this  case  the  LSE  of  x  is 

-  x  +  KT(KR_1KT  +  n)_1[y  -  y]  , 

and  the  covariance  matrix  of  x,  is  estimated  by 

A  T  -1  T  -1 

Rj  *  R  -  K  (KR  K  +  Q)  K  . 

3.  Analytic  Linearization  Method.  In  this  case  we  approximate  the 
functional  relationship  between  Ay  and  Ax  by 

Ay  -  A’ [xQ] Ax  +  e, 

where  A'lx^]  i8  the  matrix  of  the  partial  derivatives  at  a  certain  point 
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(sometimes  x^  *  E(x)).  The  LSE  of  x  is 

x2  -  x  +  r(a' )t[a’r(a’)t  +  n]_1ty  -  y] 

and  the  covariance  matrix  of  x2  is  estimated  by 

R2  -  R  -  R(A')T[A,R(A')T  ♦  n]_1A'R  . 

To  evaluate  these  techniques,  the  authors  consider  several  information- 
content  measures: 

(a)  the  Shannon  quantity  of  information  of  the  solution  component  x^ 
that  is  present  in  the  values  of  y,  which  is  given  by 

I(xi,y)  =  log^r^/r^),  and 

(b)  the  multiple  correlation  coefficient  of  x^  and  y  given  by 

*  2  2  r<2 

d.  =  1  -  (r../r..)  ,  where  r. .  and  r. .  are  the  diagonal  elements  of  the 

r  li  n  n  n  ° 

matrices  R  and  appropriate  R,  respectively. 

In  [61]  the  authors  are  concerned  with  estimating  the  relative  geo¬ 
potential  without  solving  the  inverse  problem  for  the  vertical  temperature 
distribution. 

Let  AT  =  T  -  E(T),  AL  *  L  -  E(L)  and  AH°(p)  be  the  vectors  of  deviations 
from  the  temperature  and  radiation  and  the  deviation  of  the  relative  geo¬ 
potential  of  a  surface  with  pressure  p,  respectively.  After  appropriate 
transformations,  the  following  approximate  algebraic  model  is  presented: 

(1)  AL  =  AAT 

(2)  AH° ( p )  =  [r(p)]TAT  , 
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where  At  *  (AT(p  ) . AT(p  ))T,  Al  -  (AL(v  ) , . . . ,AL(v  ))T  ,  A  is  an  m  *  n 

-  n  l^i  m 

matrix  and  r(p)  a  vector  of  known  constants. 

The  satellite  observations  are  described  by  the  model 
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where  e  is  the  error  of  measurements  and  the  model  used.  The  authors  assume 
that  c  has  a  multivariate  normal  distribution  with  mean  0  and  a  known 
covariance  matrix  K2 .  The  best  linear  unbiased  estimate  of  AH°  is  given  by: 

AH  *  q  AL  , 

*  —  -  -X  ~  ~  T 

where  q  *  (I  -  Kg  K£)(A  )  r  ,  AL  =  L  -  L  ,  and  K£  is  the  positive  square 
2  — 

root  of  ,  K  ■  K£(I  -  AA  )  and  -  is  the  notation  for  pseudo  inverse 

operation.  The  above  estimate  coincides  with  the  ordinary  least  squares 

estimate  (OLSE)  for  which  q  =  (A  )^r  if  and  only  if  K  K  =  K  K  .  The 

2  2 

authors  assume  in  their  analysis  that  =  a  I  which  guarantees  the  OLSE  q. 

The  general  solution  to  (1)  is: 

AT  =  A~  AL  ♦  s  , 

where  AT  =  T  -  T,  AL  =  L  -  L,  sis  any  vector  in  the  null  space  of  A. 
Similarly,  one  obtains 

AH°  -  AH°  -  rTPAT  -  rTA~e  , 
where  s  *  PaT  and  P  *  I  -  A  A.  Moreover, 

o2(AH°)  -  rTPK^,Pr  +  rTA~K2(A_)Tr  , 

2  ''22 
where  is  the  covariance  matrix  of  AT.  If  K£  *  o  I,  then  the  above 


formula  reduces  to 


o2(AH°)  =  rTPK^Pr  +  o2rT(ATA)“r, 


The  authors  proceed  to  investigate  the  problem  of  selecting  the  spectral 
intervals  that  will  minimize  o2(Ah°). 


In  [64]  The  authors  assume  that  the  input  information  for  analysis  of 
the  random  temperature  field  x  consists  not  only  of  aerological  data,  but 
also  of  remotely  sensed  data.  For  some  coordinates ,z^ , ,z^  on  Earth 


surface,  the  deviations  of  x(z.)  from  their  means  E[x(z.)]  are  related  to 

_  .1  -  -l 

the  variations  of  the  measured  quantities  (some  aerological  and  some  re¬ 
motely  sensed)  Y(z^)  from  their  means  E[Y(z^)]  via  the  statistical 
model : 


(1) 


Y(z)  -  E[Y(z)]  *  A(z){x(z)  -  E[x(z)]}+  e. 


In  (1)  A(z)  is  the  matrix  that  approximates  the  integral  operator  of  the 
transfer  equations  and  e(z)  is  a  random  error.  Given  the  observed  data 
at  z^,...,zn  the  authors  are  interested  in  estimating  the  meteorological 


variable  x  at  some  point  z  *  z^.  The  authors  point  out  that  the  matrix 


A  can  be  ill  conditioned  which  presents  difficulties  for  estimating  x(z^). 


They  assume  that  the  sample  mean  x(z^)  and  the  sample  covariance  matrix  of 


x(z.),  R(z.),and  the  sample  cross  covariance  matrix,  R(z.z.),  have  been 

-  - 1  -I  r 


T 

obtained  from  an  a  priori  data.  Moreover,  E[e(z)]  “  0,  E[e(z)e(z')  ]  ■  0 

T 

for  z  f  z',  and  R^  is  the  sample  estimate  of  E[c(z)e(z  )  ] .  Hence 

_  _  *"  «r 

Y(z)  ■  A(z)x(z)  and  Ry(z)  *  A(z)R(z)A  (z)  +  R^(z).  The  estimate  of  xCz^) 
that  minimizes  the  mean  square  error  is  given  by: 


-1, 


x(zQ)  *  x(zQ)  +  RjtY1^  ■  V 


where 


{R„}..  *  A(r.  )R(z. ,z . )A  (z.)  +  6..R  (z.)  , 
:  i]  -I  -i  ~j  ~j  ij  t  -l 


and 


RxY  "  ^R^5o’?1^aT^z1^ . R(z0,zn)AT(zn)] 


and  ■  1  if  i  *  j  and  0  otherwise.  The  covariance  matrix  of  x(z q)  is 


estimated  by 


The  authors  state  that  the  inversion  of  is  quite  difficult  because  n  is 


large  and  they  use  a  recursive  algorithm  to  overcome  this  difficulty. 
The  size  of  the  sample,  n,  is  called  a  filter  of  order  n  and  the 

A  * 

estimation  procedure,  x(zq)  *  x(z^|z^ . zn),  if  z^  t  z  ^ ,  j  *  1 . . 


is  called  optimum  extrapolation  if  n  =  1  and  optimum  interpolation  if 


n  >  1.  When  all  n  observations  are  taken  successively  at  (i.e.. 


z.  ■  Zq,  j  *  l,...,n),  then  the  estimation  procedure  is  called  spatial 
adjustment  (filtration).  The  performance  of  these  procedures  are  evaluated 
in  the  paper.  The  authors  have  studied  how  the  size  of  the  filter  and  the 
actual  location  of  the  points  z  , ,z^  in  relation  to  z^  affect  the 
accuracy  of  the  estimation  procedure.  Also  investigated  was  the  effect 
of  combining  the  serological  data  with  remotely  sensed  data  in  the  estimation 
procedure . 


g.  Spatial  Analysis  ([20],  [22],  [34],  [35],  [36],  [45],  [46],  [48], 
[61],  [62],  [64],  [72],  [76],  [77]). 

In  [34]  the  author  describes  the  three-dimensional  structure  of  the 
ocean-atmosphere  radiance  field  that  has  been  obtained  by  measuring  reflected 
solar  radiation  in  the  visible  range  (.4  -  .7  ym)  at  two  heights  (.3  and 
10  km)  in  the  atmosphere. 

The  author  calculates  the  spatial  radiance  profiles  itCX^),  i  *  1,2,..., 
in  individual  spectral  bands  of  .01  vim  in  the  .4  -  .7  urn  range.  From  these 
calculations  it  is  deduced  that  the  radiance  of  the  ocean-atmosphere  system 
is  a  random  function  of  the  spatial  coordinates  and  time.  The  variability 
of  the  atmospheric  optical  parameters  is  quite  high.  The  effect  of  the 


atmosphere  on  I  (X^)  is  most  noticeable  at  the  height  of  10  km.,  where  the 
maximum  spread  of  the  spectra  for  some  values  of  X^  reaches  75%.  There  is 
a  strong  correlation  of  large  radiance  variations  at  each  of  the  measurement 
heights  for  different  wavelengths  xk- 

The  author  assumes  that  the  spatial  structure  of  the  radiance  field 
of  the  ocean  surface  or  the  ocean-atmosphere  system  is  stationary  and 

therefore  is  characterized  by  the  autocorrelation  function  R  (p).  These 

I>, 

functions  are  plotted  and  examined  for  the  two  heights  and  several  wave¬ 
lengths.  By  comparing  the  autocorrelation  function  at  the  beginning,  middle, 
and  end  of  the  .4  -  .7  pm  band,  the  author  studies  the  variation  over  the 
spectrum  of  contributions  to  the  radiance  field. 

The  spectral  densities,  S  (u>),  are  calculated  as  well  and  are  used  in 

h 

the  investigation  of  the  nonuniformities  in  the  spatial  structure  of  the 
radiance  field. 


In  this  article,  [77],  the  authors  simulated  Goudrian's  theoretical 

model  for  plant  canopy.  The  plant  canopy  is  modeled  by  a  horizontally  infinite 

layer  whose  bioelements  have  a  Poisson  spatial  distribution.  The  layer  consists 

of  m  sublayers  L  thick  each.  There  is  no  mutual  shadowing  of  bioelements 
s 

within  these  sublayers.  The  azimuthal  distribution  of  bioelements  is 
assumed  to  be  uniform,  hence  their  orientations  are  defined  by  their  inclinations 
3^.  Functions  ffe^)  are  defined  for  nine  10°  sectors  in  the  range  0  -  90°, 
such  that  T  f(0. )  *  1.  The  average  projection  of  the  entire  layer  L  onto 

V1 

the  direction  along  the  zenith  angle  0  is 

8 

9 

E[G(©  )]  -  I  f(£u)G(£  ,£  )  , 

8  0-1  b  8  b 
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where  G(0g,0^)  is  Che  azimuth-averaged  projected  leaf  surface  inclined  by 
0.  into  Che  0  direction.  The  contribution  at  each  10°  sector  to  the  total 

D  S 

irradiance  of  the  horizontal  surface  is  given  by  a  symmetric  weight  function 

B.,(0  ).  The  fraction  of  the  leaf  surface  area  that  interacts  with  the 

U  a 

radiation  within  the  L#  layer  is 

M. (0  )  «  L  E[G(0  )]/sin(0  )  . 

IS  s  s  s 

Based  on  these  quantities  and  several  other  assumptions,  the  Goudrian  model 
specifies  the  fraction  of  radiation  attenuated  by  the  jth  layer. 

The  authors  state  that  the  assumption  of  a  random  spatial  distribution 
of  the  bioelements  restricts  the  application  of  Goudrian1 s  model.  The  model 
is  reasonable  for  vegetation  canopies  that  have  no  well  ordered  structure, 
like  crops  and  meadows.  The  authors  compared  measured  and  predicted  (from 

I 

the  simulation)  reflectances  for  winter  wheat  fields.  The  results  support 
the  applicability  of  Goudrian's  model. 

In  the  simulation  of  Goudrian's  model  to  study  the  reflectance  b^ ,  the 
effect  of  the  following  two  parameters  was  examined  in  detail. 

(1)  Leaf  area  index,  that  characterizes  soil  coverage  by  bioelements 
and  is  related  to  percentage  of  vegetative  cover. 

(2)  Leaf  orientation. 

h.  Spectral  Analysis  ([51,  [20],  [22],  [33],  [34],  [38],  [57],  [59], 
[63],  [72],  [76]). 

In  [48]  the  authors  analyze  the  fluctuations  of  the  radiative  components 
of  the  ocean-atao sphere  system  and  the  ocean  surface  with  respect  to  monthly 
averages  of  the  radiation  budget. 


The  radiation  budget  of  the  Earth's  surface  is  determined  from  the 


following  transfer  equation: 


I(z,v) 


l(z,y) 


,  2n+lf  ip  (z)  ^ 

IT  i  • 

n  0  l  tn(2)  j 


where  I(z,y)  and  l(z,y)  are  fields  of  the  shortwave  and  longwave  radiation, 

z  being  the  vertical  coordinate,  y  the  cosine  of  the  zenith  angle,  (<P  ,ip  ) 

n  n 

are  vectors  of  the  intensity  angular  distribution  moments  and  are 
Legendre  polynomials .  It  follows  that  the  ground  radiation  budget  B^  is 
given  by: 

B  ■  <{i,(z  *  0)  +  il),(z  *  0)  . 

g  1  1 

The  values  of  and  i|>-  are  tabulated  for  various  models  of  the  atmosphere. 

The  effect  of  the  particular  models  in  determining  B  from  satellite  data 
is  very  small  because  of  the  process  of  averaging  the  data  over  time  and 
experimental  conditions. 

The  authors  consider  year-to-year  variations  of  the  two  radiative 
components  in  an  analysis  of  measurements  in  North  Atlantic  for  the  period 
of  45  months  in  1974-1978.  Their  measure  of  variation  is  the  variances  of 
the  entire  45  month  data  with  respect  to  the  monthly  averages  within 
2 . 5°-lattitude  by  2 . 5°-longitude  grid.  Figures  with  isolines  for  the  monthly 
averages  of  the  radiation  budget  are  presented  and  carefully  analyzed.  Zones 
of  significant  year-to-year  variations  of  radiation  budget  were  identified. 

To  study  the  spatial  distribution  of  the  effect  of  zones  of  significant 
year-to-year  radiation,  the  authors  employ  a  two-dimensional  Fourier  analysis 
of  the  fields  of  the  radiation  budget  and  its  components.  Among  other  things, 
the  authors  estimate  the  spectral  density  and  the  correlation  function  of  the 
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anomalies  of  the  two-dimensional  radiation  budget  field.  By  analyzing  the 
spectral  density  function,  the  authors  explain  the  structure  of  the  spatial 
variability  (two-dimensional  variance).  Principal  component  analysis  is 
performed  on  the  correlation  matrices  and  the  radiation  budget  anomalies  are 
explained  in  terms  of  a  few  principal  components. 

The  following  topics  were  not  discussed  in  this  section  because  of  their 
elementary  nature:  Analysis  of  Variance  ([30]),  Optimal  Design  ([26],  [53]) 
and  Principal  Component  Analysis  ([1],  [28]). 


4.  SUMMARY 

In  conclusion  of  this  survey  article,  I  would  like  to  stress  again  the 
fact  that  the  following  three  journals  give  a  pretty  complete  account  of  the 
statistical  techniques  used  by  Russian  scientists  in  remote  sensing: 

Soviet  Journal  of  Remote  Sensing,  Izvestiya,  Atmospheric  and  Oceanic  Physics, 
and  Soviet  Meteorology  and  Hydrology. 

It  becomes  clear  from  this  survey  that  almost  all  the  areas  of  statistics 
have  been  used  in  the  analysis  of  remotely  sensed  data.  There  are  quite  a  few 
statistical  techniques  that  have  not  been  used,  especially  those  that  represent 
the  advances  in  the  last  ten  to  fifteen  years.  For  example,  no  modern 
exploratory  data  analysis  techniques  (graphics,  outlier  detection,  regression 
diagnostics,  etc.),  robust  statistical  methods  or  sample  reuse  techniques 
(bootstrap,  jackknife)  have  been  used  at  all.  On  the  other  hand,  the  mathe¬ 
matical  models  that  govern  the  observed  processes  are  quite  complex,  as  are 


the  instrumentation  that  is  used  in  collecting  the  data.  That  is  why  the 
author  of  this  article  believes  that  there  is  a  need  for  collaboration  between 
the  statisticians  and  the  scientists  that  study  remotely  sensed  data. 
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